SYNOPSIS
Cluster data using the k means algorithm. Can use either the Euclidean distance (default) or the Manhattan distance. If the Manhattan distance is used, then centroids
are computed as the component-wise median rather than mean.
OPTIONS
-
distanceFunction -- The distance function to use for instances comparison (default: Euclidean distance).
-
maxIterations -- set maximum number of iterations
-
dontReplaceMissingValues -- Replace missing values globally with mean/mode.
-
numClusters -- set number of clusters